Search CORE

15 research outputs found

JOINT_FORCES : unite competing sentiment classifiers with random forest

Author: Cieliebak Mark
Dürr Oliver
Uzdilli Fatih
Publication venue: Association for Computational Linguistics
Publication date: 01/01/2014
Field of study

In this paper, we describe how we created a meta-classifier to detect the message-level sentiment of tweets. We participated in SemEval-2014 Task 9B by combining the results of several existing classifiers using a random forest. The results of 5 other teams from the competition as well as from 7 general purpose commercial classifiers were used to train the algorithm. This way, we were able to get a boost of up to 3.24 F1 score points

ZHAW digitalcollection

Swiss-chocolate : sentiment detection using sparse SVMs and part-of-speech n-grams

Author: Cieliebak Mark
Jaggi Martin
Uzdilli Fatih
Publication venue: Association for Computational Linguistics
Publication date: 01/01/2014
Field of study

We describe a classifier to predict the message-level sentiment of English microblog messages from Twitter. This paper describes the classifier submitted to the SemEval-2014 competition (Task 9B). Our approach was to build up on the system of the last year’s winning approach by NRC Canada 2013 (Mohammad et al., 2013), with some modifications and additions of features, and additional sentiment lexicons. Furthermore, we used a sparse (l1-regularized) SVM, instead of the more commonly used l2-regularization, resulting in a very sparse linear classifier

Crossref

ZHAW digitalcollection

A Twitter corpus and benchmark resources for german sentiment analysis

Author: Cieliebak Mark
Deriu Jan Milan
Egger Dominic
Uzdilli Fatih
Publication venue: Association for Computational Linguistics
Publication date: 01/01/2017
Field of study

In this paper we present SB10k, a new corpus for sentiment analysis with approx.10,000 German tweets. We use this new corpus and two existing corpora to provide state-of-the-art bench-marks for sentiment analysis in German:we implemented a CNN (based on the winning system of SemEval-2016) and a feature-based SVM and compare their performance on all three corpora. For the CNN, we also created German word embeddings trained on 300M tweets. These word embeddings were then optimized for sentiment analysis using distant-supervised learning. The new corpus, the German word embeddings (plain and optimized), and source code to re-run the benchmarks are publicly available

Crossref

ZHAW digitalcollection

SwissCheese at SemEval-2016 Task 4: Sentiment Classification Using an Ensemble of Convolutional Neural Networks with Distant Supervision

Author: Deriu Jan
Gonzenbach Maurice
Jaggi Martin
Luca Valeria De
Lucchi Aurélien
Uzdilli Fatih
Publication venue
Publication date: 01/01/2016
Field of study

In this paper, we propose a classifier for predicting message-level sentiments of English micro-blog messages from Twitter. Our method builds upon the convolutional sentence embedding approach proposed by (Severyn and Moschitti, 2015a; Severyn and Moschitti, 2015b). We leverage large amounts of data with distant supervision to train an ensemble of 2-layer convolutional neural networks whose predictions are combined using a random forest classifier. Our approach was evaluated on the datasets of the SemEval-2016 competition (Task 4) outperforming all other approaches for the Message Polarity Classification task

Infoscience - École polytechnique fédérale de Lausanne

Repository for Publications and Research Data

Crossref

Adverse drug reaction detection using an adapted sentiment classifier

Author: Cieliebak Mark
Egger Dominic
Uzdilli Fatih
Publication venue
Publication date: 01/01/2016
Field of study

ZHAW digitalcollection

Potential and limitations of commercial sentiment detection tools

Author: Cieliebak Mark
Dürr Oliver
Uzdilli Fatih
Publication venue: RWTH Aachen
Publication date: 01/01/2013
Field of study

In this paper, we analyze the quality of several commercial tools for sentiment detection. All tools are tested on nearly 30,000 short texts from various sources, such as tweets, news, reviews etc. In addition to the quality analysis (measured by various metrics), we also investigate the effect of increasing text length on the performance. Finally, we show that combining all tools using machine learning techniques increases the overall performance significantly

ZHAW digitalcollection

Potential and Limitations of Commercial Sentiment Detection Tools

Author: Fatih Uzdilli
Mark Cieliebak
Oliver Dürr
Publication venue
Publication date: 01/01/2013
Field of study

Author names in alphabetic order Abstract. In this paper, we analyze the quality of several commercial tools for sentiment detection. All tools are tested on nearly 30,000 short texts from various sources, such as tweets, news, reviews etc. In addition to the quality analysis (measured by various metrics), we also investigate the effect of increasing text length on the performance. Finally, we show that combining all tools using machine learning techniques increases the overall performance significantly

CiteSeerX

ZHAW digitalcollection

Twitter can help to find adverse drug reactions

Author: Cieliebak Mark
Egger Dominic
Uzdilli Fatih
Publication venue: European Research Consortium for Informatics and Mathematics
Publication date: 01/01/2016
Field of study

ZHAW digitalcollection

Back to the Future : Time-Travelling-Debugger als Alternative zu klassischen Debuggern

Author: Cieliebak Mark
Schutzbach Daniel
Uzdilli Fatih
Publication venue: Software-&-Support-Media
Publication date: 01/01/2015
Field of study

Time-Travelling-Debugger versprechen das Paradies für Software-Entwickler: Frei im Code vorwärts und rückwärts navigieren, nachträglich Log-Statements einfügen, an beliebige Zeitpunkte in der Code-Ausführung springen. Wir nehmen diese vielversprechenden Tools genauer unter die Lupe und beantworten folgende Fragen: Wie funktioniert die Technologie? Wo kann man einen Time-Travelling-Debugger am besten einsetzen? Und welche Einschränkungen gibt es in der Praxis?

ZHAW digitalcollection

Swiss-Chocolate: Sentiment Detection using Sparse SVMs and Part-Of-Speech n-Grams.

Author: Cieliebak Mark
Jaggi Martin
Uzdilli Fatih
Publication venue
Publication date: 21/06/2017
Field of study

We describe a classifier to predict the message-level sentiment of English micro-blog messages from Twitter. This pa-per describes the classifier submitted to the SemEval-2014 competition (Task 9B). Our approach was to build up on the sys-tem of the last year’s winning approach by NRC Canada 2013 (Mohammad et al., 2013), with some modifications and addi-tions of features, and additional sentiment lexicons. Furthermore, we used a sparse (`1-regularized) SVM, instead of the more commonly used `2-regularization, result-ing in a very sparse linear classifier.

Infoscience - École polytechnique fédérale de Lausanne

CiteSeerX